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ERROR REPORT 



BIOTEc^iNOLOGY 
SYSTEMS 
BRANCH 




The Biotechnology Systems Branch of the Scientific and Technical Information 
Center (STIC) detected errors when processing the following computer readable 
form: 



Application Serial Number: 



Source: 



Date Processed by STIC: 



/>or O 9 



RECEIVED 

SEP 0 9 2002 

TECH CENTER 1600/2900 



THE ATTACHED PRINTOUT EXPLAINS DETECTED ERRORS. 

PLEASE FORWARD THIS INFORMATION TO THE APPLICANT BY EITHER: 

1) INCLUDING A COPY OF THIS PRINTOUT IN YOUR NEXT COMMUNICATION TO THE 
APPLICANT, WITH A NOTICE TO COMPLY or, 

2) TELEPHONING APPLICANT AND FAXING A COPY OF THIS PRINTOUT, WITH A 
NOTICE TO COMPLY 

FOR CRF SUBMISSION QUESTIONS, PLEASE CONTACT MARK SPENCER, 703-308-4212. 



FOR SEQUENCE RULES INTERPRETATION, PLEASE CONTACT ROBERT WAX, 703-308-4216. 
PATENTIN 2.1 e-mail help: patin21help@uspto.gov or phone 703-306-41 19 (R. Wax) 
PATENTIN 3.0 e-mail help: patin3help@uspto.gov or phone 703-306-41 19 (R. Wax) 

TO REDUCE ERRORED SEQUENCE LISTINGS, PLEASE USE THE CHECKER 
VERSION 3.0 PROGRAM , ACCESSIBLE THROUGH THE U.S. PATENT AND 
TRADEMARK OFFICE WEBSITE. SEE BELOW: v 



Checker Version 3.0 

The Checker Version 3.0 application is a state-of the-art Windows based software program 
employing a logical and intuitive user-interface to check whether a sequence listing is in 
compliance with format and content rules. Checker Version 3.0 works for sequence listings 
generated for the original version of 37 CFR§§ 1.821 - 1.825 effective October 1, 1990 (old 
rules) and the revised version (new rules) effective July 1, 1998 as well as World Intellectual 
Property Organization (WIPO) Standard ST.25. 

Checker Version 3.0 replaces the previous DOS-based version of Checker, and is Y2K- 
compliant. Checker allows public users to check sequence listings in Computer Readable form 
(CRF) before submitting them to the United States Patent and Trademark Office (USPTO). 
Use of Checker prior to filing the sequence listing is expected to result in fewer errored sequence 
listings, thus saving time and money. 

Checker Version 3.0 can be down loaded from the USPTO website at the following address: 

i http://www.uspto.gov/web/offices/pac/checker 




Scq 



uencc 



Listing Error 




mary 



ERROR DETECTED SUGGESTED CORRECTION 



SERIAL NUMBER 




Wrapped Nucleics 



Wrapped Aminos 

Incorrect Line Length 

Misaligned Amino Acid 
Numbering 

Non-ASCII 
Variable Length 



Patentln ver. 2.0 "bug" 



Skipped Sequences 
(OLD RULES) 



Skipped Sequences 
(NEW RULES) 

Use of n's or Xaa's 
(NEW RULES) 

Use of "Artificial" 
(NEW RULES) 

Use of <220>Feature 
(NEW RULES) 

Patentln ver. 2.0 "bug" 



-EASE DISREGARD ENGLISH "ALPHA" HEADERS, WHICH WERE INSERTED BY PTO SOFTWARE 
The numberAext at the end of each line "wrapped" down to the next line. 
This may occur if your file was retrieved in a word processor after creating it. 
Please adjust your right margin to .3, as this will prevent "wrapping". 

The amino acid numberAext at the end of each line "wrapped " down to the next line. 
This may occur if your file was retrieved in a word processor after creating it. 
Please adjust your right margin to .3, as this will prevent "wrapping". 

The rules require that a line not exceed 72 characters in length. This includes spaces. 

The numbering under each 5th amino acid is misaligned. This may be caused by the use of tabs 
between the numbering. It is recommended to delete any tabs and use spacing between the numbers. 

This file was not saved in ASCII (DOS) text, as required by the Sequence Rules. 

Please ensure your subsequent submission is saved in ASCII text so that it can be processed. 

Sequence(s) contain n's or Xaa's which represented more than one residue. 

As per the rules, each n or Xaa can only represent a single residue. 

Please present the maximum number of each residue having variable length and 

indicate in the (ix) feature section that some may be missing. 

A "bug" in Patentln version 2.0 has caused the <220>-<223> section to be missing from amino acid 

sequence(s) . Normally, Patentln would automatically generate this section from the 

previously coded nucleic acid sequence. Please manually copy the relevant <220>-<223> section 
to the subsequent amino acid sequence. This applies primarily to the mandatory <220>-<223> 
sections for Artificial or Unknown sequences. 

Sequence(s) missing. If intentional, please use the following format for each skipped sequence: 

(2) INFORMATION FOR SEQ ID NO:X: 

(i) SEQUENCE CHARACTERISTICS:(Do not insert any headings under "SEQUENCE CHARACTERISTICS") 
(xi) SEQUENCE DESCRIPTION:SEQ ID NO:X: 
This sequence is intentionally skipped 

Please also adjust the "(iii) NUMBER OF SEQUENCES:" response to include the skipped sequence(s). 

Sequence(s) missing. If intentional, please use the following format for each skipped sequence. 

<210> sequence id number 
<400> sequence id number 
000 

Use of n's and/or Xaa's have been detected in the Sequence Listing. 
Use of <220> to <223> is MANDATORY if n's or Xaa's are present. 

In <220> to <223> section, please explain location of n or Xaa, and which residue n or Xaa represents. 

Use of "Artificial" only as "<21 3> Organism" response is incomplete, per 1 .823(b) of New Sequence Rules. 
Valid response is Artificial Sequence. 

Sequence(s) are missing the <220>Feature and associated headings. 

Use of <220> to <223> is MANDATORY if <213>ORGANISM is "Artificial Sequence" or "Unknown" 
Please explain source of genetic material in <220> to <223> section. 

(See "Federal Register," 6/01/98, Vol. 63, No. 104, pp. 29631-32) (Sec. 1.823 of new Rules) 

Please do not use "Copy to Disk" function of Patentln version 2.0. This causes a corrupted 

file, resulting in missing mandatory numeric identifiers and responses (as indicated on raw sequence listing). 

Instead, please use "File Manager or any other means to copy file to floppy disk. 
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PCT 



RAW SEQUENCE LISTING DATE: 05/01/2001 

PATENT APPLICATION: US/09/807,802 TIME: 11:39:02 



Mot CompW 



Input Set : A:\sequencelist.txt i/orte 
Output Set: N:\CRF3\05012001\I807802.raw Qoff©^®^ ^ % 

4 <110> APPLICANT: Wilson, James M. ^ e.^ P'iP ^" 

5 Xiao, Weidong 

6 The Trustees of the University of Pennsylvania 

8 <120> TITLE OF INVENTION: Adeno-Associated Virus Serotype I Nucleic Acid 

9 Sequences, Vectors and Host Cells Containing Same 
11 <130> FILE REFERENCE: GNVPN . 031PCT 

C--> 13 <140> CURRENT APPLICATION NUMBER: US/09/807,802 
C--> 14 <141> CURRENT FILING DATE: 2001-04-16 

16 <150> PRIOR APPLICATION NUMBER: 60/107,114 

17 <151> PRIOR FILING DATE: 1998-11-05 
19 <160> NUMBER OF SEQ ID NOS : 20 

21 <170> SOFTWARE: Patentln Ver . 2.0 



ERRORED 


SEQUENCES 






23 


<210> 


SEQ ID NO 


: 1 




24 


<211> 


LENGTH: 4718 




25 


<212> 


TYPE: DNA 






26 


<213> 


ORGANISM: 


AAV-1 




28 


<220> 


FEATURE : 






29 


<221> 


NAME/KEY : 


CDS 




30 


<222> 


LOCATION: 


(335) . . 


(2206) 


32 


<220> 


FEATURE : 






33 


<221> 


NAME/KEY : 


CDS 




34 


<222> 


LOCATION: 


(2223) . 


. (4430) 


36 


<400> 


SEQUENCE: 


1 





37 ttgcccactc cctctctgcg cgctcgctcg ctcggtgggg cctgcggacc aaaggtccgc 60 

40 agacggcaga gctctgctct gccggcccca ccgagcgagc gagcgcgcag agagggagtg 120 

43 ggcaactcca tcactagggg taatcgcgaa gcgcctccca cgctgccgcg tcagcgctga 180 

46 cgtaaattac gtcatagggg agtggtcctg tattagctgt cacgtgagtg cttttgcgac 240 

49 attttgcgac accacgtggc catttagggt atatatggcc gagtgagcga gcaggatctc 300 

52 cattttgacc gcgaaatttg aacgagcagc agcc atg ccg ggc ttc tac gag ate 355 

54 . Met Pro Gly Phe Tyr Glu lie 



55 






















1 








5 






57 


gtg 


ate 


aag 


gtg 


ccg 


age 


gac 


ctg 


gac 


gag 


cac 


ctg 


ccg 


ggc 


att 


tct 


403 


59 


Val 


lie 


Lys 


Val 


Pro 


Ser 


Asp 


Leu 


Asp 


Glu 


His 


Leu 


Pro 


Gly 


He 


Ser 




60 






10 










15 










20 










62 


gac 


teg 


ttt 


gtg 


age 


tgg 


gtg 


gec 


gag 


aag 


gaa 


tgg 


gag 


ctg 


ccc 


ccg 


451 


64 


Asp 


Ser 


Phe 


Val 


Ser 


Trp 


Val 


Ala 


Glu 


Lys 


Glu 


Trp 


Glu 


Leu 


Pro 


Pro 




65 




25 










30 










35 












67 


gat 


tct 


gac 


atg 


gat 


ctg 


aat 


ctg 


att 


gag 


cag 


gca 


ccc 


ctg 


acc 


gtg 


499 


69 


Asp 


Ser 


Asp 


Met 


Asp 


Leu 


Asn 


Leu 


He 


Glu 


Gin 


Ala 


Pro 


Leu 


Thr 


Val 




70 


40 










45 










50 










55 




72 


gec 


gag 


aag 


ctg 


cag 


cgc 


gac 


ttc 


ctg 


gtc 


caa 


tgg 


cgc 


cgc 


gtg 


agt 


547 


74 


Ala 


Glu 


Lys 


Leu 


Gin 


Arg 


Asp 


Phe 


Leu 


Val 


Gin 


Trp 


Arg 


Arg 


Val 


Ser 





75 60 65 70 
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RAW SEQUENCE LISTING DATE: 
PATENT APPLICATION: US/09/807 f 802 TIME: 

Input Set : A:\sequencelist.txt 

Output Set: N:\CRF3\05012001\I807802.raw 



05/01/2001 
11:39:02 



77 


day 


y^u 


cv-y - 


gag 


ci f~* r~< 






LLL 




Cay 




gag 


day 


ggc 


its /*t 

gag 


cot. 




7 Q 


.uys 


nld 


D ~»™ /~s 

riu 




A .La 


Leu 


tr ne 


trie 


vai 


C 1 n 

uin 


rfle 


bill 


jjys 


C~\ XT 

Gly 


blU 


O V 

ber 




80 








75 










80 










85 








82 


tac 


ttc 


cac 


etc 


cat 


att 


ctg 


gtg 


gag 


acc 


acg 


ggg 


gtc 


aaa 


tec 


atg 


643 


84 


Tyr 


Phe 


His 


Leu 


His 


He 


Leu 


Val 


Glu 


Thr 


Thr 


Gly 


Val 


Lys 


Ser 


Met 




85 






90 










95 










100 










87 


gtg 


ctg 


ggc 


cgc 


ttc 


ctg 


agt 


cag 


att 


agg 


gac 


aag 


ctg 


gtg 


cag 


acc 


691 


89 


Val 


Leu 


Gly 


Arg 


Phe 


Leu 


Ser 


Gin 


He 


Arg 


Asp 


Lys 


Leu 


Val 


Gin 


Thr 




90 




105 










110 










115 












92 


ate 


tac 


cgc 


ggg 


ate 


gag 


ccg 


acc 


ctg 


ccc 


aac 


tgg 


ttc 


gcg 


gtg 


acc 


7,39 


94 


He 


Tyr 


Arg 


Gly 


He 


Glu 


Pro 


Thr 


Leu 


Pro 


Asn 


Trp 


Phe 


Ala 


Val 


Thr 




95 


120 










125 










130 










135 




97 


aag 


acg 


cgt 


aat 


ggc 


gec 


gga 


ggg 


ggg 


aac 


aag 


gtg 


gtg 


gac 


gag 


tgc 


787 


99 


Lys 


Thr 


Arg 


Asn 


Gly 


Ala 


Gly 


Gly 


Gly 


Asn 


Lys 


Val 


Val 


Asp 


Glu 


Cys 





100 










140 










145 










150 




102 


tac 


ate 


CCC 


aac 


tac 


etc 


ctg 


ccc 


aag 


act 


cag 


ccc 


gag 


ctg 


cag 


tgg 


104 


Tyr 


He 


Pro 


Asn 


Tyr 


Leu 


Leu 


Pro 


Lys 


Thr 


Gin 


Pro 


Glu 


Leu 


Gin 


Trp 


105 








155 










160 










165 






107 


gcg 


tgg 


act 


aac 


atg 


gag 


gag 


tat 


ata 


age 


gee 


tgt 


ttg 


aac 


ctg 


gec 


109 


Ala 


Trp 


Thr 


Asn 


Met 


Glu 


Glu 


Tyr 


He 


Ser 


Ala 


Cys 


Leu 


Asn 


Leu 


Ala 


110 






170 










175 










180 








112 


gag 


cgc 


aaa 


egg 


etc 


gtg 


gcg 


cag 


cac 


ctg 


acc 


cac 


gtc 


age 


cag 


acc 


114 


Glu 


Arg 


Lys 


Arg 


Leu 


Val 


Ala 


Gin 


His 


Leu 


Thr 


His 


Val 


Ser 


Gin 


Thr 


115 




185 










190 










195 










117 


cag 


gag 


cag 


aac 


aag 


gag 


aat 


ctg 


aac 


ccc 


aat 


tct 


gac 


gcg 


cct 


gtc 


119 


Gin 


Glu 


Gin 


Asn 


Lys 


Glu 


Asn 


Leu 


Asn 


Pro 


Asn 


Ser 


Asp 


Ala 


Pro 


Val 


120 


200 










205 










210 










215 


E--> 122 


ate 


egg 


tea 


aaa 


acc 


tec 


gcg 


cgc 


tac 


atg 


gag 


ctg 


gtc 


ggg 


tgg 


ctg 


123 
124 


1 no7 




























& 


He 


Arg 


Ser 


Lys 


Thr 


Ser 


Ala 


Arg 


Tyr 


Met 


Glu 


Leu 


Val 


Gly 


Trp 


Leu 


125 










220 










225 










230 




E--> 127 


gtg 


gac 


egg 


ggc 


ate 


acc 


tec 


gag 


aag 


cag 


tgg 


ate 


cag 


gag 


gac 


cag 


128 


1 r\n c 






























j. \j / _j 
































129 


Val 


Asp 


Arg 


Gly 


He 


Thr 


Ser 


Glu 


Lys 


Gin 


Trp 


He 


Gin 


Glu 


Asp 


Gin 



E--> 



E--> 



E--> 



E--> 



130 
132 
133 
134 
135 
137 
138 
139 
140 
142 
143 
144 
145 
147 
148 



235 

gee teg tac ate 

1123 ' 

Ala Ser 



tec ttc aac gee 



240 

get tec 



aac teg egg 



245 
tec 



cag ate 



Tyr 
250 

aag gee get ctg gac 

1171 ■ 

Lys Ala Ala Leu Asp 



He Ser Phe Asn Ala Ala Ser Asn Ser 

255 

aat gee ggc aag ate 



Arg 
260 

atg gcg ctg 



Ser Gin He 



acc aaa tec 



Asn 



265 

gcg ccc gac tac ctg gta 

1219 ■ 

Ala Pro Asp Tyr 
280 

aac cgc ate tac 

1267 ~ 



Ala Gly Lys .He Met 
270 

ggc ccc get ccg ccc 



Ala Leu Thr Lys Ser 
275 

gcg gac att aaa 



Leu 



cgc 



Val 
285 
ate 



Gly Pro Ala Pro Pro 

290 

ctg gag ctg aac ggc 



Ala Asp He Lys 



tac gaa cct gee 



835 



883 



931 



979 



f >■ 



e 



Jo 

M /?/> * < 

4 1 °" 



r 1 



Crro 



r 
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RAW SEQUENCE LISTING 

PATENT APPLICATION: US/09/807,802 



DATE: 05/01/2001 
TIME: 11:39:02 



Input Set : A:\sequencelist.txt 

Output Set: N:\CRF3\05012001\I807802.raw 



E--> 



E--> 



149 Asn Arg 
150 

152 >gcc~ggc 
153 \1315 ^ 
154 Ala Gly 

..cuacc 

Asn Thr 



He 
tec 
Ser 
ate 



E--> 



E--> 



E--> 



E--> 



E--> 



E--> 




Tyr 
gtc 

Val 
315 
tgg 

Trp 

ate 

He 

aac 

Asn 

gag 

Glu 



Arg He Leu 
300 

ttt etc ggc 



Glu Leu Asn Gly Tyr Glu Pro Ala Tyr 

305 310 
tgg gee cag aaa agg ttc ggg aag cgc 



Phe Leu Gly Trp Ala Gin Lys Arg Phe Gly Lys Arg 

320 325 
ctg ttt ggg ccg gee acc acg ggc aag acc aac ate 

Leu Phe Gly Pro Ala Thr Thr Gly Lys Thr Asn He 

335 340 
gec cac gec gtg ccc ttc tac ggc tgc gtc aac tgg 



Ala His Ala 
350 

ttt ccc ttc 

Phe Pro Phe 
365 

ggc aag atg 



Val Pro Phe Tyr Gly Cys Val Asn Trp 

355 

aat gat tgc gtc gac aag atg gtg ate 

Asn Asp Cys Val Asp Lys Met Val He 

370 375 
acg gec aag gtc gtg gag tec gee aag 



Gly Lys Met 
380 



Thr Ala Lys Val Val Glu Ser Ala Lys 
385 390 
ggc ggc age aag gtg cgc gtg gac caa aag tgc aag teg 



E--> 



E--> 



E--> 



E--> 



18 7 ^tg- 
188 ^465 1 
189 > ^Met"cys 
190 425 
192 ccg- ttg 
193^1699^ 

194 PfoLeu 

195 440 

197 cat^gac 
198(^1747) 
199 HlsAsp 
200 
202 
20 
204 
205 
207 aag_ggt 
208 QL843J ) 
209 Lys Gly 



Ala 
cag 
Gin 
ttt 

Phe 



Gly 
395 
ate 

He 

gtg 

Val 

gac 

Asp 

ggc 

Gly 

gat 



Gly Ser Lys Val Arg Val Asp Gin Lys Cys Lys Ser 

400 405 
gac ccc acc ccc gtg ate gtc acc tec aac acc aac 

Asp Pro Thr Pro Val He Val Thr Ser Asn Thr Asn 

415 - 420 

att gac ggg aac age acc acc ttc gag cac cag cag 




He Asp Gly 
430 

egg atg ttc 

Arg Met Phe 

445 
aag gtg aca 

Lys Val Thr 
460 

cac gtg acc 



Asn Ser Thr Thr Phe Glu His Gin Gin 

435 

aaa ttt gaa etc acc cgc cgt ctg gag 

Lys Phe Glu Leu Thr Arg Arg Leu Glu 

450 455 
aag cag gaa gtc aaa gag ttc ttc cgc 

Lys Gin Glu Val Lys Glu Phe Phe Arg 

465 470 
gag gtg gcg cat gag ttc tac gtc aga 



His Val Thr Glu Val Ala His Glu Phe Tyr Val Arg 

480 485 
gga gec aac aaa aga ccc gee ccc gat, gac gcg gat aaa age 

Gly Ala Asn Lys Arg Pro Ala Pro Asp Asp Ala Asp Lys Ser 
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RAW SEQUENCE LISTING 
PATENT APPLICATION: US/09/807,802 

Input Set : A:\sequencelist.txt 

Output Set: N:\CRF3\05012001\l807802.raw 



DATE: 05/01/2001 
TIME: 11:39:02 



E--> 



E--> 




495 



500 



egg gec tgc ccc tea gtc gcg gat cca teg acg tea gac 



E--> 



E--> 



E--> 



E--> 



E--> 



E--> 



220 520 

222 tcjt^cgt 

223 ^ 
224 
225 
227 
228 
229 
230 
232 
233 
234 
235 
237 

238^2131 
239 
240 
242 
243 

244 Arg Ala 

245 600 

247 ctg gat 

248 (5231 
^"eu^Asp 




W- 
E- 



W- 
E- 



W- 
E- 



W- 
E- 



> 
> 



> 
> 



> 
> 



> 
> 



Pro 
gac 
Asp 
tat 




w--> 



263 /2375 J) 

264 Gin Gin 

265 660 

267 tac'-ctc 

268 ^423 X 

269 Tyr Leu 
270 



Trp 
aag 
Lys 
gga 

Gly 



Arg Ala Cys Pro Ser Val 

510 

get ccg gtg gac ttt gec 

Ala Pro Val Asp Phe Ala 
525 

gcg ggc atg ctt cag atg 

Ala Gly Met Leu Gin Met 
540 

aat cag aat ttc aac att 

Asn Gin Asn Phe Asn lie 
555 560 
gag tgc ttc ccc ggc gtg 

Glu Cys Phe Pro Gly Val 

575 

acg tat egg aaa etc tgt 

Thr Tyr Arg Lys Leu Cys 

590 

gag att get tgc teg gee 

Glu lie Ala Cys Ser Ala 
605 

tgt gtt tct gag caa taa 

Cys Val Ser Glu Gin 
620 

ctt cca gat tgg etc gag 

Leu Pro Asp Trp Leu Glu 

635 

tgg gac ttg aaa cct gga 

Trp Asp Leu Lys Pro Gly 

650 

cag gac gac ggc egg ggt 

Gin Asp Asp Gly Arg Gly 
665 

ccc ttc aac gga etc gac 

* 

Pro Phe Asn Gly Leu Asp 
680 



Ala Asp Pro Ser Thr Ser Asp 
515 

gac agg tac caa aac aaa tgt 

Asp Arg Tyr Gin Asn Lys Cys 
530 535 
ctg ttt ccc tgc aag aca tgc 

Leu Phe Pro Cys Lys Thr Cys 
545 550 . 

tgc ttc acg cac ggg acg aga 

Cys Phe Thr His Gly Thr Arg 

565 

tea gaa tct caa ccg gtc gtc 

Ser Glu Ser Gin Pro Val Val 

580 

gee att cat cat ctg ctg ggg 

Ala lie His His Leu Leu Gly 
595 

tgc gat ctg gtc aac gtg gac 

Cys Asp Leu Val Asn Val Asp 
610 615 
atgacttaaa ccaggt atg get gee 

Met Ala Ala 
625 

gac aac etc tct gag ggc att 

Asp Asn Leu Ser Glu Gly lie 

640 

gec ccg aag ccc aaa gee aac 

Ala Pro Lys Pro Lys Ala Asn 
655 

ctg gtg ctt cct ggc tac aag 

Leu Val Leu Pro Gly Tyr Lys 
670 675 
aag ggg gag ccc gtc aac gcg 

Lys Gly Glu Pro Val Asn Ala 
685 690 
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05/01/2001 
11: 39 :02 



E--> 



W- 
E- 



W- 
E- 



W- 
E- 



W- 
E- 



W- 
E- 



W- 
E- 



W- 

E- 



W- 

E- 



W- 
E- 



W- 
E- 



W- 
E- 



W- 
E- 



> 
> 



> 
> 



> 
> 



> 
> 



> 
> 



> 
> 



> 
> 



> 
> 



> 
> 



> 
> 



> 
> 



> 
> 




RAW SEQUENCE LISTING DATE 
PATENT APPLICATION: US/09/807,802 TIME 

Input Set : A:\sequencelist.txt 

Output Set: N:\CRF3\05012001\I807802.raw 



gcg gcc etc gag cac gac aag gec tac gac cag cag etc 



278^519^ 
279 ^Lys Ala 
280 

cag 




287 cga -^gca 
288(2615^) 
289Vrg Ala 
290 740 
gtt 



Gly 
710 
gag 

Glu 

gtc 

Val 



Ala Ala 
695 

gac aat 
Asp Asn 
cgt ctg 
Arg Leu 
ttc cag 
Phe Gin 
ggc get 



Leu Glu His Asp Lys Ala Tyr Asp Gin Gin Leu 

700 705 
ccg tac ctg egg tat aac cac gcc gac gcc gag 




Pro Tyr Leu 
715 

caa gaa gat 

Gin Glu Asp 
730 

gcc aag aag 

Ala Lys Lys 
745 

aag acg get 



Arg Tyr Asn His Ala Asp Ala Glu 

720 

acg tct ttt ggg ggc aac etc ggg 

Thr Ser Phe Gly Gly Asn Leu Gly 

735 

egg gtt etc gaa cct etc ggt ctg 

Arg Val Leu Glu Pro Leu Gly Leu 
750 755 
cct gga aag aaa cgt ccg gta gag 



Lys Thr Ala Pro Gly Lys 'Lys Arg Pro Val Glu 

765 770 
cca gac tec tec teg ggc ate ggc aag aca ggc 

Pro Asp Ser Ser Ser Gly lie Gly Lys Thr Gly 

780 785 
aag aga etc aat ttt ggt cag act ggc gac tea 



Lys Arg Leu 
795 

cca caa cct 

Pro Gin Pro 
810 

act aca atg 

Thr Thr Met 
825 

ggc gcc gac 



Asn Phe Gly Gin Thr Gly Asp Ser 

800 

etc gga gaa cct cca gca ace ccc 

Leu Gly Glu Pro Pro Ala Thr Pro 

815 

get tea ggc ggt ggc gca cca atg 

Ala Ser Gly Gly Gly Ala Pro Met 
830 835 
gga gtg ggt aat gcc tea gga aat 



Gly Ala Asp Gly Val Gly Asn Ala Ser Gly Asn 

845 850 
aca tgg ctg ggc gac aga gtc ate ace ace age 

Thr Trp Leu Gly Asp Arg Val lie Thr Thr Ser 

860 865 
ttg ccc ace tac aat aac cac etc tac aag caa 



Leu Pro Thr 
875 

acg ggg gcc 



Tyr Asn Asn His Leu Tyr Lys Gin 

880 

age aac gac aac cac tac ttc ggc 
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W- 



W- 

E- 



W 

E 



W- 
E- 



E- 



W- 
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W- 
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W- 
E- 



E- 



W- 
E- 



W- 
E- 




Input Set : A:\sequencelist.txt 

Output Set: N:\CRF3\05012001\I807802.raw 

33^3047 J) 

334^T±er^er Ser Ala Ser Thr Gly Ala Ser Asn Asp Asn His Tyr Phe Gly 
-> 335 885 890 895 

-> 3 37 tac—aac acc ccc tgg ggg tat ttt gat ttc aac aga ttc cac tgc cac 

338^3095^3 

339 Tyr Ser Thr Pro Trp Gly Tyr Phe Asp Phe Asn Arg Phe His Cys His 
-> 340 900 905 910 915 

-> 3 42_£tt_tca cca cgt gac tgg cag cga etc ate aac aac aat tgg gga ttc 

Ser Pro Arg Asp Trp Gin Arg Leu lie Asn Asn Asn Trp Gly Phe 

920 925 930 

ccc aag aga etc aac ttc aaa etc ttc aac ate caa gtc aag gag 

Pro Lys Arg Leu Asn Phe Lys Leu Phe Asn lie Gin Val Lys Glu 
935 940 945 

-> 352 gte-a^g acg aat gat ggc gtc aca acc ate get aat aac ctt acc age 

353 

3 54 Tal Thr Thr Asn Asp Gly Val Thr Thr lie Ala Asn Asn Leu Thr Ser 
-> 355 950 955 960 

-> 3 57 a^g—gtt caa gtc ttc teg gac teg gag tac cag ctt ccg tac gtc etc 

358 (f^jj) 

359 Thr Val Gin Val Phe Ser Asp Ser Glu Tyr Gin Leu Pro Tyr Val Leu 
-> 360 965 970 975 

-> 362xgtjcr~tct gcg cac cag ggc tgc etc cct ccg ttc ccg gcg gac gtg ttc 

36^211^ 

3 64* Gly Ser Ala His Gin Gly Cys Leu Pro Pro Phe Pro Ala Asp Val Phe 
-> 365 980 985 990 995 

-> 3 67 atg-^att ccg caa tac ggc tac ctg acg etc aac aat ggc age caa gee 

368 (3383 J 

3 69-^tetrlle Pro Gin Tyr Gly Tyr Leu Thr Leu Asn Asn Gly Ser Gin Ala 
-> 370 1000 1005 1010 

-> 372 g*$~~Qga cgt tea tec ttt tac tgc ctg gaa tat ttc cct tct cag atg 

373 ^(3431 . 

3 74 Val Gly Arg Ser Ser Phe Tyr Cys Leu Glu Tyr Phe Pro Ser Gin Met 
-> 375 1015 1020 1025 

-> 377 >etg^a^a acg ggc aac aac ttt acc ttc age tac acc ttt gag gaa gtg 

378^3479^ 

379^LeiT*Arg Thr Gly Asn Asn Phe Thr Phe Ser Tyr Thr Phe Glu Glu Val 
-> 380 1030 1035 1040 

-> 3 82 / etftT~ttc cac age age tac gcg cac age cag age ctg gac egg ctg atg 

383 ( 352 7y 

384 v Pro Phe His Ser Ser Tyr Ala His Ser Gin Ser Leu Asp Arg- Leu Met 
-> 385 1045 1050 1055 

-> 3 8 7^aa~t-^cct etc ate gac caa tac ctg tat tac ctg aac aga act caa aat 

388 3575J) 

3 89^&sn Pro Leu lie Asp Gin Tyr Leu Tyr Tyr Leu Asn Arg Thr Gin Asn 
-> 390 1060 1065 1070 1075 

-> 3 92^-eag— tqc gga agt gee caa aac aag gac ttg ctg ttt age cgt ggg tct 

39(3 3623 
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W 
E 



W 
E 



W 
E 



W 
E 



W 

E 



W 
E 



W 
E 



W 

E- 



E- 



W- 
E 



W- 
E- 



W- 
E- 





r Arg Gin Gin Arg Val Ser Lys Thr Lys Thr Asp Asn Asn Asn Ser 
1110 1115 1120 

ttt acc tgg act ggt get tea aaa tat aac etc aat ggg cgt gaa 



394 Gin Ser Gly Ser Ala Gin Asn Lys Asp Leu Leu Phe Ser Arg Gly Ser 
--> 395 1080 1085 1090 

--> 397 jcea— g^t ggc atg tct gtt cag ccc aaa aac tgg eta cct gga ccc tgt 

398( 3671 

399^rrr""Ala Gly Met Ser Val Gin Pro Lys Asn Trp Leu Pro Gly Pro Cys 
--> 400 1095 1100 1105 

--> 402 tai-cag cag cag cgc gtt tct aaa aca aaa aca gac aac aac aac age 
403 
404 
405 
--> 407 
408 

409v£sn Phe Thr Trp Thr Gly Ala Ser ' Lys Tyr Asn Leu Asn Gly Arg Glu 
--> 410 1125 1130 1135 

■-> 412 t£C— ate ate aac cct ggc act get atg gee tea cac aaa gac gac gaa 

413/5815 J 

414(-Ser--rfe He Asn Pro Gly Thr Ala Met Ala Ser His Lys Asp Asp Glu 
■-> 415 1140 1145 1150 1155 

--> 417 gaj3— aag ttc ttt ccc atg age ggt gtc atg att ttt gga aaa gag age 

418 

419 £sp'"Lys Phe Phe Pro Met Ser Gly Val Met He Phe Gly Lys Glu Ser 
■-> 420 1160 1165 1170 

■-> 422 gGe-~grga get tea aac act gca ttg gac aat gtc atg att aca gac gaa 

423(3911^) 

424 fflai Gly Ala Ser Asn Thr Ala Leu Asp Asn Val Met He Thr Asp Glu 
■-> 425 1175 1180 1185 

■-> 427 jgag-^aaa att aaa gec act aac cct gtg gee acc gaa aga ttt ggg acc 

42 ^395 ^3 

429 Glu Glu He Lys Ala Thr Asn Pro Val Ala Thr Glu Arg Phe Gly Thr 
-> 430 1190 1195 1200 

-> 43 2 atg-~gca gtc aat ttc cag age age age aca gac cct gcg acc gga gat 

433(7007} 

434 vaTAla Val Asn Phe Gin Ser Ser Ser Thr Asp Pro Ala Thr Gly Asp 
-> 435 1205 1210 1215 

-> 43 7 g£g— eat get atg gga gca tta cct ggc atg gtg tgg caa gat aga gac 

438(^4055/ 

439 ValHis Ala Met Gly Ala Leu Pro Gly Met Val Trp Gin Asp Arg Asp 
-> 440 1220 1225 1230 1235 

-> 442 jgtxf^-tac ctg cag ggt ccc att tgg gee aaa att cct cac aca gat gga 

443 [4103) 

444VaT~Tyr Leu Gin Gly Pro He Trp Ala Lys He Pro His Thr Asp Gly 
-> 445 1240 1245 1250 

-> 44 7 cae-^ttt cac ccg tct cct ctt atg ggc ggc ttt gga etc aag aac ccg 

44^ 415l3 

44 9MTis Phe His Pro Ser Pro Leu Met Gly Gly Phe Gly Leu Lys Asn Pro 
-> 450 1255 1260 1265 

-> 452>*cc"t^cct cag ate etc ate aaa aac acg cct gtt cct gcg aat cct ccg 

453(4199) 

4'54 Vro Pro Gin He Leu He Lys Asn Thr Pro Val Pro Ala Asn Pro Pro 
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PATENT APPLICATION: US/09/807 , 802 TIME: 11:39:02 
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W- 



W- 
E- 



W- 



W- 
E- 



W- 
E- 



> 
> 



> 
> 



> 
> 



> 
> 



> 
> 



W- 
E- 

E 

E 

E 

E 



> 
> 

--> 

--> 
--> 



455 1270 1275 

457^<rcg~gag ttt tea get aca aag ttt get tea ttc 

45^ 4247^) 

459"^TaGlu Phe Ser Ala Thr Lys Phe Ala Ser Phe 
460 1285 1290 

4 62 ae_a_aga caa gtg agt gtg gaa att gaa tgg gag 

463(4295^) 

464 TWGly Gin Val Ser Val Glu He Glu Trp Glu 

465 1300 1305 1310 
4 67 agc_aag cgc tgg aat ccc gaa gtg cag tac aca 

4 68/<343 ^) 

469- v SeT'"'Lys Arg Trp Asn Pro Glu Val Gin Tyr Thr 
470 1320 1325 

4 72 tcjt_gcc aac gtt gat ttt act gtg gac aac aat 
473 
474 
475 

477 act 

478J4440 

4 79 Pro Arg Pro He Gly Thr Arg Tyr Leu Thr Arg 

1350 1355 
tt aat caata aaccggttga ttcgtttcag ttgaactttg 




a Asn Val Asp Phe Thr Val Asp Asn Asn 
1335 1340 
cgc ccc att ggc acc cgt tac ctt acc cgt 




1280 

ate acc caa tac tec 

He Thr Gin Tyr Ser 
1295 

ctg cag aaa gaa aac 

Leu Gin Lys Glu Asn 

1315 

tec aat tat gca aaa 

Ser Asn Tyr Ala Lys 

1330 

gga ctt tat act gag 

Gly Leu Tyr Thr Glu 
1345 

ccc ctg taattacgtg 

Pro Leu 
1360 

gtctcctgtc cttcttatct 



teggttae catggttata gcttacacat taactgcttg gttgegctte gcgataaaag 



aegtea tegggttace cctagtgatg gagttgecca ctccctctct gcgcgctcgc 
teggtg gggectgegg accaaaggtc cgcagacggc agagctctgc tctgccggcc 

icaccgagcg agegagegeg cagagaggga gtgggcaa 

4718 

767 <210> SEQ ID NO: 4 

<211> LENGTH: 1872 

<212> TYPE: DNA 

<213> ORGANISM: AAV-1 

<220> FEATURE: 

<221> NAME/KEY 

<222> LOCATION 



768 
769 
770 
772 
773 
774 

776 <400> SEQUENCE 

777 atg ccg ggc ttc 
Pro Gly Phe 



779 
780 
782 
784 
785 
787 
789 
790 
792 



Met 
1 

gag cac 
Glu His 



ctg 
Leu 



aag gaa tgg 
Lys Glu Trp 
35 

gag cag gca 



ccg 
Pro 
20 
gag 
Glu 



CDS 

(1) 
4 

tac 
Tyr 
5 

ggc 
Gly 

ctg 
Leu 



(1869) 



gag 
Glu 

att 
He 

ccc 
Pro 



ate 
He 

tct 
Ser 



gtg 
Val 

gac 
Asp 



ccc ctg acc 



ccg gat 
Pro Asp 
40 

gtg gec 



ate 
He 

teg 
Ser 
25 
tct 
Ser 



aag gtg 
Lys Val 

10 
ttt gtg 
Phe Val 

gac atg 
Asp Met 



gag aag ctg 



ccg age gac 
Pro Ser Asp 

age tgg gtg 
Ser Trp Val 
30 

gat ctg aat 
Asp Leu Asn 
45 

cag cgc gac 



ctg gac 48 
Leu Asp 
15 

gee gag 96 
Ala Glu 

ctg att 144 
Leu He 

ttc ctg 192 



/?<?k 

*p The types of errors shown exist throughout the Sequence Listing. 

subsequent sequences for similar errors. 
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Input Set : A:\sequencelist.txt 

Output Set: N:\CRF3\05012001\I807802.raw 

L:13 M:270 C: Current Application Number differs, Replaced Application Number 
L:14 M:271 C: Current Filing Date differs, Replaced Current Filing Date 
L:122 M:254 E: No. of Bases conflict, LENGTH : Input : 0 Counted: 1027 SEQ:1 
M:254 Repeated in SeqNo=l 
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9 RH 

i JU 
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x II 




Dorr i /"\n 

rvfc: y L\Jll f 


otju 


J- D . 




T . 

Ll . 


z _> J 


JYL . 


jjD 


w • 
w . 


1 IJ V a. HQ 


runino 


HC1U 




i 

1 U 


s-\ s~l A Y\ /*r 

Luuing 


Key ion / 




Tn ■ 

LU . 


i 


Ll . 


9 n 

Z D \J 


JYI , 


-5 J O 


W . 


inva ixa 


HJll-HlvJ 


fttlU 


KT l -1 m V"\ ip" 

IN UlllJJcX 


» 


^OUlIly 


Key ion / 




Tn ■ 
LU . 


i 
i 


Ll . 


0 R 
Z O D 


\x . 

M , 


-DOC 

jjD 


hi . 
W . 


L nva L 1Q 


HJUlIlU 
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■ 
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Key ion / 




Tn * 

LU . 


i 


J-l . 


o 7 n 

Z / \J 
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W . 


l nva l lu 
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T T^ 

in 
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t n • 
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1 
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Z / D 
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JJD 


w : 
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Aiuino 
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XT i i tti r*s 


T T^ 

in 


uoQing 
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1 


T . 
J-J . 


o q n 
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JYL . 


JJD 


TaT • 


mvai lu 
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in 
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1 


T . 
Ll . 
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± nvai ia 


TV TT\ T 

AJuino 
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in 


t v-\ ^a» 

touiny 


Kegion ^ 


C A 


id : 


1 


T . 

J_l . 


Z y V 


jyi . 


"3 "3 
JJD 


w . 


i n vai lu 


Amino 


A A 

ftClu 




■ 

in 


LOuiny 


i^S /*T T /— V T*\ 

Kegion / 


c irn 
oJjU 


IU . 
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Z J D 
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Invalid 
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Number 


in 


Coding 
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SEQ 
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1 


L: 
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M: 


336 
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Invalid 
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Acid 


Number 


in 


Coding 


Region, 


SEQ 
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M: 
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SEQ 
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Coding 
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in 
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M: 
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Number 
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Region, 
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in 
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VERIFICATION SUMMARY 

PATENT APPLICATION: US/09/807,802 



DATE: 05/01/2001 
TIME: 11:39:03 



Input Set : A:\sequencelist.txt 

Output Set: N:\CRF3\05012001\I807802.raw 



L:470 M:336 W: Invalid Amino Acid Number in Coding Region, SEQ ID:1 
L:475 M:336 W: Invalid Amino Acid Number in Coding Region, SEQ ID:1 
L:480 M:336 W: Invalid Amino Acid Number in Coding Region, SEQ ID:1 
L:877 M:254 E: No. of Bases conflict, LENGTH : Input : 0 Counted:1008 SEQ : 4 
M:254 Repeated in SeqNo=4 

L:1207 M:254 E: No. of Bases conflict, LENGTH : Input : 0 Counted:1008 SEQ:6 
M:254 Repeated in SeqNo=6 

L:1505 M:254 E: No. of Bases conflict, LENGTH : Input : 0 Counted:1008 SEQ:8 
M:254 Repeated in SeqNo=8 

L:1913 M:254 E: No. of Bases conflict, LENGTH : Input : 0 Counted: 1008 SEQ: 12 
M:254 Repeated in SeqNo=12 

L:2302 M:254 E: No. of Bases conflict, LENGTH : Input : 0 Counted: 1008 SEQ:14 
M:254 Repeated in SeqNo=14 

L:2624 M:254 E: No. of Bases conflict, LENGTH : Input : 0 Counted:1008 SEQ:16 
M:254 Repeated in SeqNo=16 

L:2858 M:254 E: No. of Bases conflict, LENGTH : Input : 0 Counted: 1020 SEQ:18 
M:254 Repeated in SeqNo=18 

L:3102 M:254 E: No. of Bases conflict, LENGTH : Input : 0 Counted: 1020 SEQ: 19 
M:254 Repeated in SeqNo=19 
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